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A Complete Characterization of the Gap between 
Convexity and SOS- Convexity 

Amir Ali Ahmadi and Pablo A. Parrilo ^ 



Abstract 



^ . Our first contribution in this paper is to prove that three natural sum of squares (sos) based 

sufficient conditions for convexity of polynomials, via the definition of convexity, its first order 
*n , characterization, and its second order characterization, are equivalent. These three equivalent 

algebraic conditions, henceforth referred to as sos-convexity, can be checked by semidefinite 
programming whereas deciding convexity is NP-hard. If we denote the set of convex and sos- 
' convex polynomials in n variables of degree d with C n ,d and £C„.d respectively, then our main 

contribution is to prove that C n ,d = ^C n ,d if and only if n — I or d — 2 or (n, d) — (2, 4). We 
also present a complete characterization for forms (homogeneous polynomials) except for the 
case (n, d) = (3,4) which is joint work with G. Blckherman and is to be published elsewhere. 
Our result states that the set C n ,d of convex forms in n variables of degree d equals the set 
EC„ ;( i of sos-convex forms if and only if n — 2 or d = 2 or (n, d) = (3, 4). To prove these results, 
we present in particular explicit examples of polynomials in C^^ \ £C*2,6 and \ £(^3,4 
£Sj ■ and forms in 63,6 \ £03,6 and 64,4 \ £6*4,4, and a general procedure for constructing forms in 

^ \ C n ^d+2 \ ^C ni d+2 from nonnegative but not sos forms in n variables and degree d. 

Although for disparate reasons, the remarkable outcome is that convex polynomials (resp. forms) 
are sos-convex exactly in cases where nonnegative polynomials (resp. forms) are sums of squares, 
as characterized by Hilbcrt. 



1 Introduction 

1.1 Nonnegativity and sum of squares 

• <— ^ ■ 

One of the cornerstones of real algebraic geometry is Hilbert's seminal paper in 1888 [22], where he 

gives a complete characterization of the degrees and dimensions for which nonnegative polynomials 
can be written as sums of squares of polynomials. In particular, Hilbert proves in [22] that there 
exist nonnegative polynomials that are not sums of squares, although explicit examples of such 
polynomials appeared only about 80 years later and the study of the gap between nonnegative and 
sums of squares polynomials continues to be an active area of research to this day. 

Motivated by a wealth of new applications and a modern viewpoint that emphasizes efficient 
computation, there has also been a great deal of recent interest from the optimization community in 
the representation of nonnegative polynomials as sums of squares (sos). Indeed, many fundamental 
problems in applied and computational mathematics can be reformulated as either deciding whether 
certain polynomials are nonnegative or searching over a family of nonnegative polynomials. It is 
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well-known however that if the degree of the polynomial is four or larger, deciding nonnegativity is 
an NP-hard problem (this follows, e.g., as an immediate corollary of NP-hardness of deciding matrix 
copositivity |30]). On the other hand, it is also well-known that deciding whether a polynomial can 
be written as a sum of squares can be reduced to solving a semidefinite program, for which efficient 
algorithms, e.g. based on interior point methods, are available. The general machinery of the 
so-called "sos relaxation" has therefore been to replace the intractable nonnegativity requirements 
with the more tractable sum of squares requirements that obviously provide a sufficient condition 
for polynomial nonnegativity. 

Some relatively recent applications that sum of squares relaxations have found span areas as 
diverse as control theory [32], [21], quantum computation [17] . polynomial games [34], combinatorial 
optimization [19] . and many others. 

1.2 Convexity and sos-convexity 

Aside from nonnegativity, convexity is another fundamental property of polynomials that is of 
both theoretical and practical significance. Perhaps most notably, presence of convexity in an 
optimization problem often leads to tractability of finding global optimal solutions. Consider for 
example the problem of finding the unconstrained global minimum of a polynomial. This is an 
NP-hard problem in general [35] . but if we know a priori that the polynomial to be minimized is 
convex, then every local minimum is global, and even simple gradient descent methods can find a 
global minimum. There are other scenarios where one would like to decide convexity of polynomials. 
For example, it turns out that the d-th root of a degree d polynomial is a norm if and only if the 
polynomial is homogeneous, positive definite, and convex [J2]. Therefore, if we can certify that a 
homogenous polynomial is convex and definite, then we can use it to define a norm, which is useful 
in many applications. In many other practical settings, we might want to parameterize a family of 
convex polynomials that have certain properties, e.g., that serve as a convex envelope for a non- 
convex function, approximate a more complicated function, or fit some data points with minimum 
error. In the field of robust control for example, it is common to use convex Lyapunov functions to 
prove stability of uncertain dynamical systems described by difference inclusions. Therefore, the 
ability to efficiently search over convex polynomials would lead to algorithmic ways of constructing 
such Lyapunov functions. 

The question of determining the computational complexity of deciding convexity of polynomials 
appeared in 1992 on a list of seven open problems in complexity theory for numerical optimiza- 
tion [31. In a recent joint work with A. Olshevsky and J. N. Tsitsiklis, we have shown that the 
problem is strongly NP-hard even for polynomials of degree four [2]. If testing membership in the 
set of convex polynomials is hard, searching or optimizing over them is obviously also hard. This 
result, like any other hardness result, stresses the need for good approximation algorithms that can 
deal with many instances of the problem efficiently. 

The focus of this work is on an algebraic notion known as sos-convexity (introduced formally 
by Helton and Nie in [20]), which is a sufficient condition for convexity of polynomials based on a 
sum of squares decomposition of the Hessian matrix; see Definition 12.41 As we will briefly review in 
Section [2j the problem of deciding if a given polynomial is sos-convex amounts to solving a single 
semidefinite program. 

Besides its computational implications, sos-convexity is an appealing concept since it bridges 
the geometric and algebraic aspects of convexity. Indeed, while the usual definition of convexity is 
concerned only with the geometry of the epigraph, in sos-convexity this geometric property (or the 
nonnegativity of the Hessian) must be certified through a "simple" algebraic identity, namely the 
sum of squares factorization of the Hessian. The original motivation of Helton and Nie for defining 
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sos-convexity was in relation to the question of semidefinite representability of convex sets [20] . But 
this notion has already appeared in the literature in a number of other settings [23], [25] . [28], 
In particular, there has been much recent interest in the role of convexity in semialgebraic geometry 
[24] . [7], [15], [26] and sos-convexity is a recurrent figure in this line of research. 

1.3 Contributions and organization of the paper 

The main contribution of this work is to establish the counterpart of Hilbert's characterization of 
the gap between nonnegativity and sum of squares for the notions of convexity and sos-convexity. 
We start by presenting some background material in Section [2j In Section [3l we prove an algebraic 
analogue of a classical result in convex analysis, which provides three equivalent characterizations 
for sos-convexity (Theorem 13. ip . This result substantiates the fact that sos-convexity is the right 
sos relaxation for convexity. In Section HI we present some examples of convex polynomials that 
are not sos-convex. In Section we provide the characterization of the gap between convexity 
and sos-convexity ( Theorem 1 5 . 1 1 and Theorem 15. 2p . Subsection 15.11 includes the proofs of the cases 
where convexity and sos-convexity are equivalent and Subsection 15.21 includes the proofs of the 
cases where they are not. In particular, Theorem 15.81 and Theorem 15.91 present explicit examples 
of convex but not sos-convex polynomials that have dimension and degree as low as possible, and 
Theorem 15.101 provides a general construction for producing such polynomials in higher degrees. 
Some concluding remarks and an open problem are presented in Section [6l 

2 Preliminaries 

2.1 Background on nonnegativity and sum of squares 

A (multivariate) polynomial p := p(x) in variables x '■= (x\, . . . ,x n ) T is a function from R n to R 
that is a finite linear combination of monomials: 

p(x) = ^ ^ C a X = ^ ] Cai,...,an x i 1 ' ' ' V j 
a (cti,. ..,a n ) 

where the sum is over ra-tuples of nonnegative integers a '■= (a±, . . . ,a n ). We will be concerned 
throughout with polynomials with real coefficients, i.e., we will have c a € R. The ring of polynomials 
in n variables with real coefficients is denoted by M[x]. The degree of a monomial x a is equal to 
ct\ + ■ • ■ + ct n . The degree of a polynomial p € R[x] is defined to be the highest degree of its 
component monomials. A polynomial p is said to be nonnegative or positive semidefinite (psd) if 
p(x) > for all x £ R n . Clearly, a necessary condition for a polynomial to be psd is for its degree 
to be even. We say that p is a sum of squares (sos), if there exist polynomials qi,...,q m such that 
p = Yli^i Qi- We denote the set of psd (resp. sos) polynomials in n variables and degree d by P n ^ 
(resp. £ n) d). Any sos polynomial is clearly psd, so we have E nj( i C P n>t j. 

A homogeneous polynomial (or a form) is a polynomial where all the monomials have the same 
degree. A form p of degree d is a homogeneous function of degree d since it satisfies p(Xx) = X d p(x) 
for any scalar A 6 R. We say that a form p is positive definite if p(x) > for all x 7^ in R™. 
Following standard notation, we denote the set of psd (resp. sos) homogeneous polynomials in n 
variables and degree d by P n ^ (resp. £ n ,d). Once again, we have the obvious inclusion C P nt d- 
All of the four sets E n ^, P n ,di ^n,di Pn,d are closed convex cones. The closedness of the sum of 
squares cone may not be so obvious. This fact was first proved by Robinson [43]. We will make 
crucial use of it in the proof of Theorem 13.11 in the next section. 
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Any form of degree d in n variables can be "dehomogenized" into a polynomial of degree < d 
in n — 1 variables by setting x n = 1. Conversely, any polynomial p of degree d in n variables can 
be "homogenized" into a form ph of degree d in n + 1 variables, by adding a new variable y, and 
letting ph(xi, . . . , x n ,y) := (%i/y, ■ ■ ■ , x n /y). The properties of being psd and sos are preserved 
under homogenization and dehomogenization [4TJj . 

A very natural and fundamental question that as we mentioned earlier was answered by Hilbert 
is to understand for what dimensions and degrees nonnegative polynomials (or forms) can always be 
represented as sums of squares, i.e, for what values of n and d we have £ nj( i = P n ^ or T, n ^ = P n; d- 
Note that because of the argument in the last paragraph, we have £ n d = P n ,d if an d only if 
£n+l,d = Pn+l,d- Hence, it is enough to answer the question just for polynomials or just for forms. 

Theorem 2.1 (Hilbert, [22]). £ n>d = P nyd if and only if n = 1 or d = 2 or (n,d) = (2,4). 
Equivalently, £ nj d = P n ^d if and only if n = 2 or d = 2 or (n,d) = (3, 4). 

The proofs of £i >( 2 = P\^ and £ n] 2 = Pn,2 are relatively simple and were known before Hilbert. 
On the other hand, the proof of the fairly surprising fact that £2,4 — -^2,4 (or equivalently £3^ = 
^3,4) is more involved. We refer the interested reader to [37], [36], [13], and references in [40] for 
some modern expositions and alternative proofs of this result. Hilbert's other main contribution 
was to show that these are the only cases where nonnegativity and sum of squares are equivalent 
by giving a nonconstructive proof of existence of polynomials in P2,6 \ £2,6 an d ^3,4 \ £3,4 (or 
equivalently forms in P3 i 6\£3,6 and i 3 4,4\£4,4). From this, it follows with simple arguments that in 
all higher dimensions and degrees there must also be psd but not sos polynomials; see |40j . Explicit 
examples of such polynomials appeared in the 1960s starting from the celebrated Motzkin form |29j : 

M(xi,X2,Xs) = x\x\ + x\x\ — Zx\x\x\ + X3, (1) 

which belongs to P^q \ £3,6, and continuing a few years later with the Robinson form [43] : 

R(xi, X 2 , X3, 2:4) = x\{xi - X 4 ) 2 + X\{X2 - X 4 ) 2 + Xg(x 3 - X 4 ) 2 + 2xiX2X 3 (xi +X2 + X3- 2x 4 ), (2) 

which belongs to P^ \ £4,4. 

Several other constructions of psd polynomials that are not sos have appeared in the literature 
since. An excellent survey is [40]. See also [41] and [8]. 

2.2 Connection to semidefinite programming and matrix generalizations 

As we remarked before, what makes sum of squares an appealing concept from a computational 
viewpoint is its relation to semidefinite programming. It is well-known (see e.g. [32], [33]) that a 
polynomial p in n variables and of even degree d is a sum of squares if and only if there exists a 
positive semidefinite matrix Q (often called the Gram matrix) such that 

p(x) = z T Qz, 

where z is the vector of monomials of degree up to d/2 

z = [l,xi,x 2 , • • . ,x n ,xix 2 , . . • ,x^ /2 ]. (3) 

The set of all such matrices Q is the feasible set of a semidefinite program (SDP). For fixed d, 
the size of this semidefinite program is polynomial in n. Semidefinite programs can be solved 
with arbitrary accuracy in polynomial time. There are several implementations of semidefinite 
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programming solvers, based on interior point algorithms among others, that are very efficient in 
practice and widely used; see [39] and references therein. 

The notions of positive semidefiniteness and sum of squares of scalar polynomials can be natu- 
rally extended to polynomial matrices, i.e., matrices with entries in M[x]. We say that a symmetric 
polynomial matrix U(x) G R[x] mxm is positive semidefinite if U(x) is positive semidefinite in the 
matrix sense for all x G M. n , i.e, if U{x) has nonnegative eigenvalues for all x G W 1 . It is straight- 
forward to see that this condition holds if and only if the scalar polynomial y T U(x)y in m + n 
variables [x;y] is psd. A homogeneous polynomial matrix U(x) is said to be positive definite, if 
it is positive definite in the matrix sense, i.e., has positive eigenvalues, for all x ^ in BL n . The 
definition of an sos-matrix is as follows [23], [18], [45] , 

Definition 2.2. A symmetric polynomial matrix U(x) G R[x] mxm , x G M. n , is an sos-matrix if 
there exists a polynomial matrix V(x) G K[x] sxm for some s G N, such that P(x) = V T (x)V(x). 

It turns out that a polynomial matrix U (x) G ]R[x] mxm , x G M. n , is an sos-matrix if and only if 
the scalar polynomial y T U(x)y is a sum of squares in M[x;y]; see [23]. This is a useful fact because 
in particular it gives us an easy way of checking whether a polynomial matrix is an sos-matrix by 
solving a semidefinite program. Once again, it is obvious that being an sos-matrix is a sufficient 
condition for a polynomial matrix to be positive semidefinite. 

2.3 Background on convexity and sos-convexity 

A polynomial p is (globally) convex if for all x and y in M n and all A G [0, 1], we have 



Since polynomials are continuous functions, the inequality in (jl]) holds if and only if it holds for a 
fixed value of A G (0, 1), say, A = \. In other words, p is convex if and only if 



for all x and y; see e.g. [441 p. 71]. Except for the trivial case of linear polynomials, an odd degree 
polynomial is clearly never convex. 

For the sake of direct comparison with a result that we derive in the next section (Theorem l3.ip . 
we recall next a classical result from convex analysis on the first and second order characterization 
of convexity. The proof can be found in many convex optimization textbooks, e.g. [101 p. 70]. The 
theorem is of course true for any twice differentiable function, but for our purposes we state it for 
polynomials. 

Theorem 2.3. Let p := p(x) be a polynomial. Let Vp := Vp(x) denote its gradient and let 
H := H(x) be its Hessian, i.e., the n x n symmetric matrix of second derivatives. Then the 
following are equivalent. 

(a) p {r^x + iy) < \p{x) + \p{y), Vx, y G W 1 ; (i.e., p is convex). 

(b) p(y) > p(x) + Vp{x) T (y — x), Vx,y G M. n ; (i.e., p lies above the supporting hyperplane at 
every point). 

(c) y T H(x)y > 0, Vx, y G W 1 ; (i.e., H(x) is a positive semidefinite polynomial matrix). 

Helton and Nie proposed in [20] the notion of sos-convexity as an sos relaxation for the second 
order characterization of convexity (condition (c) above). 

Definition 2.4. A polynomial p is sos-convex if its Hessian H := H(x) is an sos-matrix. 



p(Xx + (1 - X)y) < Xp(x) + (1 - A)p(y). 



(4) 




(5) 
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With what we have discussed so far, it should be clear that sos-convexity is a sufficient condition 
for convexity of polynomials and can be checked with semidefinite programming. In the next section, 
we will show some other natural sos relaxations for polynomial convexity, which will turn out to 
be equivalent to sos-convexity. 

We end this section by introducing some final notation: C n ^ and XC n ^ will respectively denote 
the set of convex and sos-convex polynomials in n variables and degree d; C n ^ and £C n]( 2 will 
respectively denote set of convex and sos-convex homogeneous polynomials in n variables and degree 
d. Again, these four sets are closed convex cones and we have the obvious inclusions SC ni d C C n ^d 
and EC re)d C C n>d . 



3 Equivalent algebraic relaxations for convexity of polynomials 

An obvious way to formulate alternative sos relaxations for convexity of polynomials is to replace 
every inequality in Theorem 12.31 with its sos version. In this section we examine how these relax- 
ations relate to each other. We also comment on the size of the resulting semidefinite programs. 
Our result below can be thought of as an algebraic analogue of Theorem [ 



Theorem 3.1. Let p := p(x) be a polynomial of degree d in n variables with its gradient and 
Hessian denoted respectively by Vp := Vp{x) and H := H{x). Let g\, gy , and gy2 be defined as 

9\(x,y) = (1 - X)p(x) + \p{y) -p((l - X)x + Xy), 

9v(x,y) = p(y) -p{x) - Vp(x) T (y - x), (6) 
gs/z(x,y) = y T H(x)y. 

Then the following are equivalent: 

(a) gi(x,y) is sofl 

(b) gsj{x,y) is sos. 

(c) gy2(x,y) is sos; (i.e., H{x) is an sos-matrix). 

Proof. (a)=^(b): Assume gi is sos. We start by proving that gj_ will also be sos for any integer 

2 2 fc 

k > 2. A little bit of straightforward algebra yields the relation 

9-^ 0, y) = \9^ (x, y) + gi (x, ^-x + ^yj . (7) 
The second term on the right hand side of (I7|) is always sos because gi is sos. Hence, this relation 

2 

shows that for any k, if gj_ is sos, then so is g 1 . Since for k = 1, both terms on the right hand 

2 k 2 k + 1 

side of ([7]) are sos by assumption, induction immediately gives that g i is sos for all k. 

~2F 

Now, let us rewrite g\ as 

9\(x, y) = p{x) + X(p{y) - p(x)) - p(x + X(y - x)). 

We have 

= P (y) -p(x) - p(x + A( ^ x)) " p(x) . (8) 

Next, we take the limit of both sides of (JH]) by letting A = ^ — > as /c — > oo. Because p is 
differ entiable, the right hand side of (JH]) will converge to g\j. On the other hand, our preceding 



The constant | in g 1 (x, y) of condition (a) is arbitrary and chosen for convenience. One can show that g 1 being 
sos implies that g\ is sos for any fixed A € [0, 1]. Conversely, if g\ is sos for some A £ (0, 1), then gi is sos. The 
proofs are similar to the proof of (a)=>(b). 
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argument implies that ^ is an sos polynomial (of degree d in 2n variables) for any A = p-. 
Moreover, as A goes to zero, the coefficients of ^ remain bounded since the limit of this sequence 
is 5V; which must have bounded coefficients (see ©). By closedness of the sos cone, we conclude 
that the limit gy must be sos. 

(b)=>(a): Assume gy is sos. It is easy to check that 

£i (x, y) = \av {\x + \y, x) + \gy (\x + \y, y) , 

and hence gi is sos. 

2 

(b) =^(c): Let us write the second order Taylor approximation of p around x: 

p(y)= p{x) + V T p{x)(y - x) 

+\{y - x) T H(x)(y - x) + o(\\y - x\\ 2 ). 

After rearranging terms, letting y = x + ez (for e > 0), and dividing both sides by e 2 we get: 

(p(x + ez) -p(x))/e 2 - V T p{x)z/e = ^z T H(x)z + l/e 2 o(e 2 | \z\ | 2 ). (9) 

The left hand side of ([9]) is gy(x, x + ez)/e 2 and therefore for any fixed e > 0, it is an sos polynomial 
by assumption. As we take e — >• 0, by closedness of the sos cone, the left hand side of ([9]) converges 
to an sos polynomial. On the other hand, as the limit is taken, the term 4ro(e 2 |M| 2 ) vanishes and 
hence we have that z T H(x)z must be sos. 

(c) =>(b): Following the strategy of the proof of the classical case in |48j p. 165], we start by 
writing the Taylor expansion of p around x with the integral form of the remainder: 

P(y) =p(x) + V T p{x){y-x) + [ (l-t)(y-x) T H(x + t(y-x))(y-x)dt. (10) 

J o 

Since y T H(x)y is sos by assumption, for any t G [0, 1] the integrand 

(1 -t)(y- x) T H(x + t(y - x))(y - x) 
is an sos polynomial of degree d in x and y. From (I10p we have 

9V= f (l-t)(y-x) T H(x + t(y-x))(y-x)dt. 
Jo 

It then follows that gv is sos because integrals of sos polynomials, if they exist, are sos. To see 
the latter fact, note that we can write the integral as a limit of a sequence of Riemann sums by 
discretizing the interval [0, 1] over which we are integrating. Since every finite Riemann sum is an 
sos polynomial of degree d, and since the sos cone is closed, it follows that the limit of the sequence 
must be sos. 

□ 

We conclude that conditions (a), (b), and (c) are equivalent sufficient conditions for convexity 
of polynomials, and can each be checked with a semidefinite program as explained in Subsection [221 
It is easy to see that all three polynomials gi(x,y), g\7(x,y), and gy2(x,y) are polynomials in 2n 

2 

variables and of degree d. (Note that each differentiation reduces the degree by one.) Each of these 
polynomials have a specific structure that can be exploited for formulating smaller SDPs. For 
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example, the symmetries gi(x,y) = gi(y,x) and g\/2(x, —y) = gy2(x,y) can be taken advantage of 

2 2 

via symmetry reduction techniques developed in |18j . 

The issue of symmetry reduction aside, we would like to point out that formulation (c) (which 
was the original definition of sos-convexity) can be significantly more efficient than the other two 
conditions. The reason is that the polynomial g^2(x,y) is always quadratic and homogeneous in 
y and of degree d — 2 in x. This makes g^2(x,y) much more sparse than gy{x,y) and g^2(x,y), 
which have degree d both in x and in y. Furthermore, because of the special bipartite structure of 
y T H(x)y, only monomials of the form x a yi (i.e., linear in y) will appear in the vector of monomials 
([3]). This in turn reduces the size of the Gram matrix, and hence the size of the SDP. It is 
perhaps not too surprising that the characterization of convexity based on the Hessian matrix is a 
more efficient condition to check. After all, at a given point x, the property of having nonnegative 
curvature in every direction is a local condition, whereas characterizations (a) and (b) both involve 
global conditions. 

Remark 3.1. There has been yet another proposal for an sos relaxation for convexity of polynomials 
in [II]. However, we have shown in [4j that the condition in pT] is at least as conservative as the 
three conditions in Theorem 13.11 and also significantly more expensive to check. 

Remark 3.2. Just like convexity, the property of sos-convexity is preserved under restrictions to 
affine subspaces. This is perhaps most directly seen through characterization (a) of sos-convexity 
in Theorem l3.H by also noting that sum of squares is preserved under restrictions. Unlike convexity 
however, if a polynomial is sos-convex on every line (or even on every proper affine subspace), this 
does not imply that the polynomial is sos-convex. 

As an application of Theorem 13. 1\ we use our new characterization of sos-convexity to give a 
short proof of an interesting lemma of Helton and Nie. 

Lemma 3.2. (Helton and Nie [20^ Lemma 8]). Every sos-convex form is sos. 

Proof. Let p be an sos-convex form of degree d. We know from Theorem 13.11 that sos-convexity of 
p is equivalent to the polynomial gi(x,y) = \p(x) + \p(y) — p (\x + \y) being sos. But since sos 
is preserved under restrictions and p(0) = 0, this implies that 

gi(x,0) = \p(x)-p(\x) = Q - {\) d )p(x) 
is sos. □ 
Note that the same argument also shows that convex forms are psd. 

4 Some constructions of convex but not sos-convex polynomials 

It is natural to ask whether sos-convexity is not only a sufficient condition for convexity of poly- 
nomials but also a necessary one. In other words, could it be the case that if the Hessian of a 
polynomial is positive semidefinite, then it must factor? To give a negative answer to this question, 
one has to prove existence of a convex polynomial that is not sos-convex, i.e, a polynomial p for 
which one (and hence all) of the three polynomials gi,ffv, and g^/2 in ([6]) are psd but not sos. 
Note that existence of psd but not sos polynomials does not imply existence of convex but not 
sos-convex polynomials on its own. The reason is that the polynomials gi,g\7, and g^2 all possess 

a very special structured For example, y T H(x)y has the structure of being quadratic in y and a 

2 There are many situations where requiring a specific structure on polynomials makes psd equivalent to sos. As an 
example, we know that there are forms in \ £4,4. However, if we require the forms to have only even monomials, 
then all such nonnegative forms in 4 variables and degree 4 are sums of squares |16| . 
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Hessian in x. (Not every polynomial matrix is a valid Hessian.) The Motzkin or the Robinson 
polynomials in ([1]) and ([2]) for example are clearly not of this structure. 

In an earlier paper, we presented the first example of a convex but not sos-convex polyno- 
mial [5]. [3pl : 

p(xi,x 2 ,x 3 ) = 32xf + 118x^x1 + 40x1x1 + 25xfx% - 43xfx|x| - 35xfx| + 3xfxjxj 

(11) 

-\§x\x\x\ + 24x\x% + 16x§ + 44x§z§ + 70x|x| + 60x|x[j + 30x§. 

As we will see later in this paper, this form which lives in C%$\YjCj,$ turns out to be an example in 
the smallest possible number of variables but not in the smallest degree. We next present another 
example of a convex but not sos-convex form that has not been previously in print. The example 
is in 6*6,4 \ SC6,4 and by contrast to the previous example, it will turn out to be minimal in the 
degree but not in the number of variables. What is nice about this example is that unlike the other 
examples in this paper it has not been derived with the assistance of a computer and semidefinite 
programming: 

q(xi, . . . ,x e ) = x\ + x\ + x\ + x\ + x\ + x\ 

+9/ ^2 ™2 i ^2^2 i ™2™2 i ™2^2 i ™2™2 i , T ,2,~,2\ 

(12) 

+ 1 { /y.2 ~,2 1 ™2™2 1 ~.2~,2\ 1^ ^,2^2 ™2™2 ^2^.2 
-(X1X2X4X5 + XiX-iX^XQ + X 2 X 3 X 5 Xq). 

The proof that this polynomial is convex but not sos-convex can be extracted from [21 Thm. 2.3 
and Thm. 2.5]. In there, a general procedure is described for producing convex but not sos-convex 
quartic forms from any example of a psd but not sos biquadratic form. The biquadratic form that 
has led to the form above is that of Choi in [12]. 

Also note that the example in (|12|) shows that convex forms that possess strong symmetry 
properties can still fail to be sos-convex. The symmetries in this form are inherited from the rich 
symmetry structure of the biquadratic form of Choi (see [18]). In general, symmetries are of interest 
in the study of positive semidefinite and sums of squares polynomials because the gap between psd 
and sos can often behave very differently depending on the symmetry properties; see e.g. [9]. 

5 Characterization of the gap between convexity and sos-convexity 

Now that we know there exist convex polynomials that are not sos-convex, our final and main goal 
is to give a complete characterization of the degrees and dimensions in which such polynomials can 
exist. This is achieved in the next theorem. 

Theorem 5.1. SC ni d = C n d if and only if n = 1 or d = 2 or (n,d) = (2,4). 

We would also like to have such a characterization for homogeneous polynomials. Although 
convexity is a property that is in some sense more meaningful for nonhomogeneous polynomials 
than for forms, one motivation for studying convexity of forms is in their relation to norms [42] . 

3 Assuming P^NP, and given the NP-hardness of deciding polynomial convexity [2J, one would expect to see convex 
polynomials that are not sos-convex. However, our first example in [5] appeared before the proof of NP-hardness [2j. 
Moreover, from complexity considerations, even assuming P=^NP, one cannot conclude existence of convex but not 
sos-convex polynomials for any finite value of the number of variables n. 
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Also, in view of the fact that we have a characterization of the gap between nonnegativity and 
sums of squares both for polynomials and for forms, it is very natural to inquire the same result 
for convexity and sos-convexity. The next theorem presents this characterization for forms. 

Theorem 5.2. £C nj d = C n ^ if and only if n = 2 or d = 2 or (n,d) = (3,4). 

The result T1C34 = C3 4 of this theorem is joint work with G. Blekherman and is to be presented 
in full detail in pp. The remainder of this paper is solely devoted to the proof of Theorem 15. II and 
the proof of Theorem 15.21 except for the case (n,d) = (3,4). Before we present these proofs, we 
shall make two important remarks. 

Remark 5.1. Difficulty with homogenization and dehomogenization. Recall from Subsec- 
tion 12.11 and Theorem 12.11 that characterizing the gap between nonnegativity and sum of squares 
for polynomials is equivalent to accomplishing this task for forms. Unfortunately, the situation 
is more complicated for convexity and sos-convexity and that is the reason why we are present- 
ing Theorems 15.11 and 15.21 as separate theorems. The difficulty arises from the fact that unlike 
nonnegativity and sum of squares, convexity and sos-convexity are not always preserved under 
homogenization. (Or equivalently, the properties of being not convex and not sos-convex are not 
preserved under dehomogenization.) In fact, any convex polynomial that is not psd will no longer 
be convex after homogenization. This is because convex forms are psd but the homogenization of 
a non-psd polynomial is a non-psd form. Even if a convex polynomial is psd, its homogenization 
may not be convex. For example the univariate polynomial 10xf — 5xi + 2 is convex and psd, 
but its homogenization lOxf — 5xix^ + 2x\ is not convexQ To observe the same phenomenon for 
sos-convexity, consider the trivariate form p in (II ip which is convex but not sos-convex and define 
p(x2,xs) = p(l, X2, X3). Then, one can check that p is sos-convex (i.e., its 2 x 2 Hessian factors) 
even though its homogenization which is p is not sos-convex [5] . 

Remark 5.2. Resemblance to the result of Hilbert. The reader may have noticed from the 
statements of Theorem 12.11 and Theorems 15.11 and 15.21 that the cases where convex polynomials 
(forms) are sos-convex are exactly the same cases where nonnegative polynomials are sums of 
squares! We shall emphasize that as far as we can tell, our results do not follow (except in the 
simplest cases) from Hilbert 's result stated in Theorem 12.11 Note that the question of convex- 
ity or sos-convexity of a polynomial p(x) in n variables and degree d is about the polynomials 

gi(x,y), gx?(x,y), or gy2(x,y) defined in Q being psd or sos. Even though these polynomials still 

2 

have degree d, it is important to keep in mind that they are polynomials in In variables. Therefore, 
there is no direct correspondence with the characterization of Hilbert. To make this more explicit, 
let us consider for example one particular claim of Theorem 15.21 = ^2,4- For a form p in 

2 variables and degree 4, the polynomials <?i,5V) an d <?v 2 wu l De forms in 4 variables and degree 

2 

4. We know from Hilbert's result that in this situation psd but not sos forms do in fact exist. 

However, for the forms in 4 variables and degree 4 that have the special structure of gi,g\7, or <?y2, 

2 

psd turns out to be equivalent to sos. 

The proofs of Theorems 1 5 . 1 1 and 15 . 2 1 are broken into the next two subsections. In Subsection l5.il 
we provide the proofs for the cases where convexity and sos-convexity are equivalent. Then in 
Subsection 15. 2\ we prove that in all other cases there exist convex polynomials that are not sos- 
convex. 

4 What is true however is that a nonnegative form of degree d is convex if and only if the d-th root of its deho- 
mogenization is a convex function |42l Prop. 4.4]. 
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5.1 Proofs of Theorems 15.11 and 15.21 : cases where T,C nt d = C n ,d, SC^d = C n ^ 

When proving equivalence of convexity and sos-convexity, it turns out to be more convenient to work 
with the second order characterization of sos-convexity, i.e., with the form g^i{x,y) = y T H{x)y in 
([6]). The reason for this is that this form is always quadratic in y, and this allows us to make use 
of the following key theorem, henceforth referred to as the "biform theorem" . 

Theorem 5.3 (e.g. |14j). Let f := f(u\,U2,V\,...,v m ) be a form in the variables u := (ui,U2) T 
and v := (vi, , . . . ,v m ) T that is a quadratic form in v for fixed u and a form (of any degree) in u 
for fixed v. Then f is psd if and only if it is sos0 

The biform theorem has been proven independently by several authors. See [13] and [6] for 
more background on this theorem and in particular [141 Sec. 7] for an elegant proof and some 
refinements. We now proceed with our proofs which will follow in a rather straightforward manner 
from the biform theorem. 

Theorem 5.4. £Ci a = C\ & for all d. EC^ d = C2,d for all d. 

Proof. For a univariate polynomial, convexity means that the second derivative, which is another 
univariate polynomial, is psd. Since £1^ = Pi jC i, the second derivative must be sos. Therefore, 
SCi^ = Ci : d- To prove EC^d = C^d, suppose we have a convex bivariate form p of degree d in 
variables x := (x\,X2) T . The Hessian H := H(x) of p is a 2 x 2 matrix whose entries are forms 
of degree d — 2. If we let y '■= (2/i,2/2) T , convexity of p implies that the form y T H(x)y is psd. 
Since y T H{x)y meets the requirements of the biform theorem above with {u\,v,2) = (^1,^2) an d 
(v\,V2) = (2/1,2/2)) it follows that y T H(x)y is sos. Hence, p is sos-convex. □ 

Theorem 5.5. £C ni 2 = Cn,2 for all n. £C n) 2 = C n ,2 for all n. 

Proof. Let x := (x\, ■ ■ ■ , x n ) T and y := (yi, . . . , y n ) T ■ Let p(x) = \x T Qx + b T x + c be a quadratic 
polynomial. The Hessian of p in this case is the constant symmetric matrix Q. Convexity of p 
implies that y T Qy is psd. But since S nj 2 = P n ,2, y T Qy must be sos. Hence, p is sos-convex. The 
proof of SC ni 2 = Cn,2 is identical. □ 

Theorem 5.6. EC 2 ,i = C2,4- 

Proof. Let p(x) := p(x\,X2) be a convex bivariate quartic polynomial. Let H := H(x) denote 
the Hessian of p and let y := (yi,y2) T - Note that H(x) is a 2 x 2 matrix whose entries are 
(not necessarily homogeneous) quadratic polynomials. Since p is convex, y T H(x)y is psd. Let 
H(xi,X2,X3) be a 2 x 2 matrix whose entries are obtained by homogenizing the entries of H. It 
is easy to see that y T H(xi, X2, x%)y is then the form obtained by homogenizing y T H(x)y and is 
therefore psd. Now we can employ the biform theorem (Theorem 15 .3|) with (ui,U2) = (2/1,2/2) an d 
(vi,V2,vs) = (xi,X2,xz) to conclude that y T H(xi,X2,X3)y is sos. But upon dehomogenizing by 
setting £3 = 1, we conclude that y T H(x)y is sos. Hence, p is sos-convex. □ 

Theorem 5.7 (Ahmadi, Blekherman, Parrilo [1]). EC3 t 4 = C%^. 

Unlike Hilbert's results ^2,4 = P2,4 and £3^ = ^3,4 which are equivalent statements and essen- 
tially have identical proofs, the proof of T^C^^ = C^^ is considerably more involved than the proof 
of SC2,4 = C2,a- Here, we briefly point out why this is the case and refer the reader to [1] for more 
details. 

5 Note that the results T,2,d = Pi,d and E„,2 = P n ,2 are both special cases of this theorem. 
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If p(x) := p(xi, X2, X3) is a ternary quartic form, its Hessian H (x) is a 3 x 3 matrix whose entries 
are quadratic forms. In this case, we can no longer apply the biform theorem to the form y T H{x)y. 
In fact, the matrix 



C(x) 



v x\ + 2x\ 



-xix 2 



-X1X2 



-X1X3 



-X2X3 



-xix 3 



-X2X3 xi + 2x\ 



due to Choi [12] serves as an explicit example of a 3 x 3 matrix with quadratic form entries that is 
positive semidefinite but not an sos-matrix; i.e., y T C(x)y is psd but not sos. However, the matrix 
C{x) above is not a valid Hessian, i.e., it cannot be the matrix of the second derivatives of any 
polynomial. If this was the case, the third partial derivatives would commute. On the other hand, 
we have in particular 

aCi,i(s) = Q , _ xs = dC lt3 (x) 
8x3 dxi 

It is rather remarkable that with the additional requirement of being a valid Hessian, the form 
y T H(x)y turns out to be psd if and only if it is sos pQ. 



5.2 Proofs of Theorems 15.11 and 15.2b cases where EC n ^ C C n ,di ^Cn,d C C n ^ 

The goal of this subsection is to establish that the cases presented in the previous subsection 
are the only cases where convexity and sos-convexity are equivalent. We will first give explicit 
examples of convex but not sos-convex polynomials/forms that are "minimal" jointly in the degree 
and dimension and then present an argument for all dimensions and degrees higher than those of 
the minimal cases. 



5.2.1 Minimal convex but not sos-convex polynomials /forms 

The minimal examples of convex but not sos-convex polynomials (resp. forms) turn out to belong 
to 62,6 \ SC2,6 and 63,4 \ TiC^^ (resp. \ ^C^fi and \ £64,4). Recall from Remark 15.11 
that we lack a general argument for going from convex but not sos-convex forms to polynomials 
or vice versa. Because of this, one would need to present four different polynomials in the sets 
mentioned above and prove that each polynomial is (i) convex and (ii) not sos-convex. This is a 
total of eight arguments to make which is quite cumbersome. However, as we will see in the proof 
of Theorem 15.81 and 15.91 below, we have been able to find examples that act "nicely" with respect 
to particular ways of dehomogenization. This will allow us to reduce the total number of claims 
we have to prove from eight to four. 

The polynomials that we are about to present next have been found with the assistance of a 
computer and by employing some "tricks" with semidefinite programming In this process, we have 
made use of software packages YALMIP [27], SOSTOOLS [38], and the SDP solver SeDuMi @7J, 
which we acknowledge here. To make the paper relatively self-contained and to emphasize the 
fact that using rational sum of squares certificates one can make such computer assisted proofs 
fully formal, we present the proof of Theorem 15.81 below in the appendix. On the other hand, 
the proof of Theorem 15.91 which is very similar in style to the proof of Theorem 15.81 is largely 

6 We do not elaborate here on how this was exactly done. The interested reader is referred to [5] Sec. 4], where a 
similar technique for formulating semidefinite programs that can search over a convex subset of the (non-convex) set 
of convex but not sos-convex polynomials is explained. The approach in [5] however does not lead to examples that 
are minimal. 
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omitted to save space. All of the proofs are available in electronic form and in their entirety at 
http : //aaa. lids .mit . edu/sof tware or at http : //arxiv . org/abs/1111 . 4587. 

Theorem 5.8. T,C 2 ^ is a proper subset of ' 6*2,6 • EC3.6 i$ a proper subset ofC^^. 

Proof. We claim that the form 

f(x 1 ,x 2 ,x 3 ) = 77xf - 155xf x 2 + M5xfx% + 7§x\x\ + 556xf x\ + ®&x\x\ + 240x!| - 9xfx 3 

-\\mx\x\x z + <o2x\x\x z + Yimx x x\x z - 343£ 2 , x 3 + 363xfx| + 773x?x 2 z§ 



-891xfx|x| - %mx x x\x\ + 1043x^§ - 14ufa;| - 1108xfx 2 x| - 216xi 



2^,3 



2" L "3 



-839x|x| + 72\x\x\ + 436xix 2 x| + 378^^ + 48xix| - 97x 2 x| + 89x| 
belongs to C 3i 6 \ SC 3i 6, and the polynomial 

f{x!,x 2 ) = f(xi,x 2 ,l - -x 2 ) (14) 

belongs to C 2> 6 \ SC 2j 6- Note that since convexity and sos-convexity are both preserved under 
restrictions to affine subspaces (recall Remark 13. 2p , it suffices to show that the form / in (|13p is 
convex and the polynomial / in (jl"4"|) is not sos-convex. Let x ■= (x\, x 2 , x 2 ) T , y := (yi, 2/2, 2/3) T , 
x := (xi,x 2 ) T , y := (yi,y 2 ) T , and denote the Hessian of / and / respectively by Hf and Hi. In 
the appendix, we provide rational Gram matrices which prove that the form 

(x 2 1 +x 2 2 )-y T H f {x)y (15) 

is sos. This, together with nonnegativity of x\ + x\ and continuity of y T HAx)y, implies that 
y T Hj{x)y is psd. Therefore, / is convex. The proof that / is not sos-convex proceeds by showing 
that Hj is not an sos-matrix via a separation argument. In the appendix, we present a separating 
hyperplane that leaves the appropriate sos cone on one side and the polynomial 

y T H f (x)y (16) 

on the other. □ 

Theorem 5.9. EC34 is a proper subset of 63,4. EC44 is a proper subset of C^. 



7 The polynomial f(xi,X2, 1) turns out to be sos-convex, and therefore does not do the job. One can of course 
change coordinates, and then in the new coordinates perform the dehomogenization by setting x$ = 1. 
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Proof. We claim that the form 



2-2 

.1 Q 



, . . . , x 4 ) = 1671acf - 4134xf x 2 - 3332x?x 3 + 5104x? x\ + 4989xf x 2 x 3 + 3490xf 

-2203xix| - 3030xix|x 3 - 3776xix 2 x| - 1522xix| + 1227x1 - 595xjjs 3 
+1859x|x| + 1146x 2 x| + 979x| + 1195728x| - 1932xix^ - 2296x 2 x| 



2J1 



2-M 



-3144x 3 x| + 1465xf x\ - 1376xfx 4 - 263xix 2 x| + 2790xf x 2 x 4 + 2121 
-292xix|x 4 - 1224x|x 4 + 2404xix 3 xl + 2727x 2 x 3 x| - 2852xix|x 4 
— 388x 2 x|x 4 — 1520x 3 x 4 + 2943xf x 3 x 4 — 5053xix 2 x 3 x 4 + 2552x2X 3 x 4 
+3512x|x| 

(17) 

belongs to C 4i4 \ £C 4i4 , and the polynomial 

h(x 1 ,x 2 ,x 3 ) = h(xi,x 2 ,x 3 , 1) (18) 

belongs to C 3>4 \ SC 3>4 . Once again, it suffices to prove that h is convex and h is not sos-convex. 
Let x := (xi, x 2 , x 3 , x 4 ) T , y := (y\, y 2 , y 3 , y 4 ) T , and denote the Hessian of /i and respectively by 
Hh and i?^. The proof that h is convex is done by showing that the form 

(x 2 2 + xl + xl)-y T H h (x)y (19) 

is sosJU The proof that h is not sos-convex is done again by means of a separating hyperplane. □ 

5.2.2 Convex but not sos-convex polynomials /forms in all higher degrees and dimen- 
sions 

Given a convex but not sos-convex polynomial (form) in n variables, it is very easy to argue that 
such a polynomial (form) must also exist in a larger number of variables. If p(x\, . . . , x n ) is a form 
in C n>d \ EC Ujd , then 

p(xi, x n+ i) = p(xi, . . . , x n ) + x^ +1 

belongs to C n +i d \ EC„+i^. Convexity of p is obvious since it is a sum of convex functions. The 
fact that p is not sos-convex can also easily be seen from the block diagonal structure of the Hessian 
of p: if the Hessian of p were to factor, it would imply that the Hessian of p should also factor. The 
argument for going from C n d \ £C ni d to C n+ i,d \ SC n+ i irf is identical. 

Unfortunately, an argument for increasing the degree of convex but not sos-convex forms seems 
to be significantly more difficult to obtain. In fact, we have been unable to come up with a natural 
operation that would produce a form in C n ^+2 \ SC ni d+ 2 from a form in C n ^ \ TiC n ^- We will 
instead take a different route: we are going to present a general procedure for going from a form 
in P n ,d \ S„ i( i to a form in C n d+2 \ ^Cn,d+2- This will serve our purpose of constructing convex 
but not sos-convex forms in higher degrees and is perhaps also of independent interest in itself. 
For instance, it can be used to construct convex but not sos-convex forms that inherit structural 
properties (e.g. symmetry) of the known examples of psd but not sos forms. The procedure is 
constructive modulo the value of two positive constants (7 and a below) whose existence will be 
shown nonconstructivelyH 



8 The choice of multipliers in (|15|l and (|19[) is motivated by a result of Reznick in [39] . 

9 The procedure can be thought of as a generalization of the approach in our earlier work in [5] . 
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Although the proof of the general case is no different, we present this construction for the case 
n = 3. The reason is that it suffices for us to construct forms in C 3) d \ £C3,d for d even and > 8. 
These forms together with the two forms in \ and 64,4 \ presented in (fT3|) and 

(|17p . and with the simple procedure for increasing the number of variables cover all the values of n 
and d for which convex but not sos-convex forms exist. 

For the remainder of this section, let x := (x\, X2, x 3 ) T and y := (yi,y2,U3) T ■ 

Theorem 5.10. Let m := m(x) be a ternary form of degree d (with d necessarily even and > 6) 
satisfying the following three requirements: 

Rl: m is positive definite. 

R2: m is not a sum of squares. 

R3: The Hessian H m of m is positive definite at the point (1,0,0) T . 

Let g := g(x2, x%) be any bivariate form of degree d + 2 whose Hessian is positive definite. 
Then, there exists a constant 7 > 0, such that the form f of degree d + 2 given by 

rxi rs 

f(x)= / / m(t,x 2 ,x 3 )dtds + ^g(x 2 ,x 3 ) (20) 
Jo JO 

is convex but not sos-convex. 

The form / in (|20p is just a specific polynomial that when differentiated twice with respect to 
x\ gives m. The reason for this construction will become clear once we present the proof of this 
theorem. Before we do that, let us comment on how one can get examples of forms m and g that 
satisfy the requirements of the theorem. The choice of g is in fact very easy. We can e.g. take 

1 \ / 2 2\ d+2 

9{X2,X 3 ) = {x 2 +x 3 ) 2 , 

which has a positive definite Hessian. As for the choice of m, essentially any psd but not sos ternary 
form can be turned into a form that satisfies requirements Rl, R2, and R3. Indeed if the Hessian 
of such a form is positive definite at just one point, then that point can be taken to (1, 0, 0) T by a 
change of coordinates without changing the properties of being psd and not sos. If the form is not 
positive definite, then it can made so by adding a small enough multiple of a positive definite form 
to it. For concreteness, we construct in the next lemma a family of forms that together with the 
above theorem will give us convex but not sos-convex ternary forms of any degree > 8. 

Lemma 5.11. For any even degree d > 6, there exists a constant a > 0, such that the form 

m{x) = xf~ 6 (xjx2 + x\x\ - ?>x\x\x\ + x%) + a(x\ + x% + x 2 3 )i (21) 
satisfies the requirements Rl, R2, and R3 of Theorem \5.KA 
Proof. The form 

24, 42 o222, 6 
12 ' x-^x 2 OX^X2X 3 

is the familiar Motzkin form in ([T|) that is psd but not sos |29j . For any even degree d > 6, the 
form 

d— 6/ 24, 42 o222, 6\ 

is a form of degree d that is clearly still psd and less obviously still not sos; see [3D]. This together 
with the fact that £ nj d is a closed cone implies existence of a small positive value of a for which 
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the form m in (|2ip is positive definite but not a sum of squares, hence satisfying requirements Rl 
and R2. 

Our next claim is that for any positive value of a, the Hessian H m of the form m in (|2ip satisfies 



JUi.o.o) 



d 
c 2 
c 3 



(22) 



for some positive constants cx,C2, C3, therefore also passing requirement R3. To see the above 
equality, first note that since m is a form of degree d, its Hessian H m will have entries that are 
forms of degree d — 2. Therefore, the only monomials that can survive in this Hessian after setting 
X2 and X3 to zero are multiples of xf~ 2 . It is easy to see that an xf~ 2 monomial in an off-diagonal 
entry of H m would lead to a monomial in m that is not even. On the other hand, the form m in 
(|2ip only has even monomials. This explains why the off-diagonal entries of the right hand side of 
(|22p are zero. Finally, we note that for any positive value of a, the form m in (|2ip includes positive 
multiples of o, and which lead to positive multiples of xf- 2 on the diagonal of 

H m . Hence, ci,C2, and C3 are positive. □ 

Next, we state two lemmas that will be employed in the proof of Theorem 15.101 



Lemma 5.12. Let m be a trivariate form satisfying the requirements Rl and R3 of Theorem \5.10l 

Let denote the Hessian of the form f Q Xl f Q s m(t,X2,x^)dtds. Then, there exists a positive con- 
stant 5, such that 

y T H lh {x)y > 

on the set 

S := {(x,y) I ||x|| = l,||y|| = 1, (x 2 2 + x 2 3 <5 ory 2 2 + yl< 5)}. (23) 
Proof. We observe that when y 2 + y 2 = 0, we have 

y T H fn (x)y = yfm(x), 



which by requirement Rl is positive when ||x|| = ||y|| = 1. By continuity of the form y H 7 f L {x)y, 
we conclude that there exists a small positive constant 5 y such that y T H^(x)y > on the set 

S y := {(x,y) I ||sc|| = 1, \\y\\ = 1, y\ + y\ < S y }. 

Next, we leave it to the reader to check that 

ffm(l,0,0) = 1 ff m (l,0,0). 

Therefore, when x\ + x 2 = 0, requirement R3 implies that y T H^ l (x)y is positive when = 
||y|| = 1. Appealing to continuity again, we conclude that there exists a small positive constant 
5 X such that y T H^ l {x)y > on the set 

Sx ■= {(x,y) I ||x|| = 1, \\y\\ = 1, x\ + x% < 5 X }. 

If we now take 5 = mm{5 y , 8 X }, the lemma is established. □ 

The last lemma that we need has already been proven in our earlier work in [5]. 
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Lemma 5.13 ([5]). All principal minors of an sos-matrix are sos polynomials] 10 ] 

We are now ready to prove Theorem 15.101 

Proof of Theorem \5.1(A We first prove that the form / in (|20p is not sos-convex. By Lemma 15.131 
if / was sos-convex, then all diagonal elements of its Hessian would have to be sos polynomials. On 
the other hand, we have from (|2Up that 

dm 

- m(x), 



dx\dx\ 

which by requirement R2 is not sos. Therefore / is not sos-convex. 

It remains to show that there exists a positive value of 7 for which / becomes convex. Let us 
denote the Hessians of /, J^ 1 m(t, X2, xs)dtds, and g, by Hj, H^, and H g respectively. So, we 
have 

H f (x) = H A (x) + jH g (x 2 , x 3 ). 

(Here, H g is a 3 x 3 matrix whose first row and column are zeros.) Convexity of / is of course 
equivalent to nonnegativity of the form y i H f(x)y. Since this form is bi-homogeneous in x and y, 
it is nonnegative if and only if y T Hf(x)y > on the bi-sphere 

B ■= {(x,y) I = 1, \\y\\ = 1}. 

Let us decompose the bi-sphere as 

B = SUS, 

where S is defined in (123p and 

S '■= {{x,y) I Ikll = 1, IMI = l,xj + xl> 8,y% + yl > 5}. 

Lemma T5.12I together with positive definiteness of H g imply that y T Hf(x)y is positive on <S. As 
for the set <S, let 

P\ = min_y T Hfh(x)y, 

and 

(3 2 = mm_y T H g (x2,x 3 )y. 
By the assumption of positive definiteness of H g , we have fa > 0. If we now let 

7> -5-, 
Pi 

then 

min y T H f (x)y > fa + ^fa > 0. 

Hence y T Hf(x)y is nonnegative (in fact positive) everywhere on B and the proof is completed. □ 

Finally, we provide an argument for existence of bivariate polynomials of degree 8, 10, 12, . . . 
that are convex but not sos-convex. 



As a side note, we remark that the converse of Lemma 15.131 is not true even for polynomial matrices that are 
valid Hessians. For example, all 7 principal minors of the 3x3 Hessian of the form / in (|13[1 are sos polynomials, 
even though this Hessian is not an sos-matrix. 
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Corollary 5.14. Consider the form f in A20\) constructed as described in Theorem \5.1(A Let 

f(xi,x 2 ) = f(xi,x 2 , 1). 

Then, f is convex but not sos-convex. 

Proof. The polynomial / is convex because it is the restriction of a convex function. It is not 
difficult to see that 

df(xi,x 2 ) , . 
— — = m(x 1 ,x 2 , 1), 

OX\OX\ 

which is not sos. Therefore from Lemma 15.131 / is not sos-convex. □ 

Corollary 15.141 together with the two polynomials in C 2 $ \ SC2.6 an d £^4 \ £6*3^ presented in 
(jl4j) and (|18|) , and with the simple procedure for increasing the number of variables described at the 
beginning of Subsection 15.2.21 cover all the values of n and d for which convex but not sos-convex 
polynomials exist. 

6 Concluding remarks and an open problem 

To conclude our paper, we would like to point out some similarities between nonnegativity and 
convexity that deserve attention: (i) both nonnegativity and convexity are properties that only hold 
for even degree polynomials, (ii) for quadratic forms, nonnegativity is in fact equivalent to convexity, 
(iii) both notions are NP-hard to check exactly for degree 4 and larger, and most strikingly (iv) 
nonnegativity is equivalent to sum of squares exactly in dimensions and degrees where convexity is 
equivalent to sos-convexity. It is unclear to us whether there can be a deeper and more unifying 
reason explaining these observations, in particular, the last one which was the main result of this 
paper. 

Another intriguing question is to investigate whether one can give a direct argument proving 
the fact that EC„ i( i = C n ^ if and only if EC n+ i i( 2 = C n+ i jC i- This would eliminate the need for 
studying polynomials and forms separately, and in particular would provide a short proof of the 
result £3,4 = C3 ; 4 given in pp. 

Finally, an open problem related to this work is to find an explicit example of a convex form 
that is not a sum of squares. Blekherman [7] has shown via volume arguments that for degree 
d > 4 and asymptotically for large n such forms must exist, although no examples are known. In 
particular, it would be interesting to determine the smallest value of n for which such a form exists. 
We know from Lemma 13.21 that a convex form that is not sos must necessarily be not sos-convex. 
Although our several constructions of convex but not sos-convex polynomials pass this necessary 
condition, the polynomials themselves are all sos. The question is particularly interesting from 
an optimization viewpoint because it implies that the well-known sum of squares relaxation for 
minimizing polynomials [46], [35j is not always exact even for the easy case of minimizing convex 
polynomials. 
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A Certificates complementing the proof of Theorem 15.81 

Let x := (x 1 ,x 2 ,x 2 ) T , y := {yi,y 2 ,yz) T , x := (x 1 ,x 2 ) T , y := {yi,y 2 ) T , and let f,f,H f , and Hj be 
as in the proof of Theorem 15.81 This appendix proves that the form (x\ + x^) • y T Hf(x)y in (|15p 
is sos and that the polynomial y T Hj{x)y in (|16p is not sos, hence proving respectively that / is 

convex and / is not sos-convex. 

A rational sos decomposition of (x\ + x 2 ) ■ y 1 H j(x)y, which is a form in 6 variables of degree 
8, is as follows: 

(x\ + x\) ■ y T H f (x)y = —z T Qz, 
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where z is the vector of monomials 

z = [x 2 xly 3 , x 2 x\y 2 ,x 2 x\yx,x\x 3 y 3 ,x\x 3 y 2 ,x\x 3 y x ,x\y 3 , x\y 2 ,x\y x , 

x\x\y 3n x 1 x\y 2 , x 1 xly 1 ,xiX2X 3 y 3 ,xiX2X 3 y2, xtx 2 x 3 yi, xixjy 3 , x\x\y 2 ,x\x\y x , 
xjx 3 y 3 ,xlx 3 y 2 , x\x 3 y x ,x\x 2 y 3 , x\ x 2 y2, x\x 2 yi, xfy 3 , x\y 2 , x\yi} T ', 
and Q is the 27 x 27 positive definite matrbi^l presented on the next page 

Q=[Qi Q2], 



11 Whenever we state a matrix is positive definite, this claim is backed up by a rational LDL T factorization of the ma- 
trix that the reader can find online at http: //aaa. lids .mit . edu/ software or at http : //arxiv . org/abs/1 111 .4587. 
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Next, we prove that the polynomial y T H^(x)y in (|16p is not sos. Let us first present this 
polynomial and give it a name: 

t(x, y) := y T H~ f (x)y = 294xix 2 y| - 6995x|?/i?/ 2 - 10200xiyi?/ 2 - 4356xf x 2 y\ - 2904xf y x y 2 

-\\Mhx x x\y\ + 13680x2^17/2 + 4764xix 2 y? + 4764xfyiy 2 + 6429xfx^yf 
+294xlyi2/ 2 - 13990xix^y| - 12123xfx 2 y| - 3872x 2 yiy 2 + ^x\y\ 
+20520xix|y| + 29076xix 2 yyi2/ 2 - 24246x1X^12/2 + 14901xix|yiy 2 
+15039xf x\y x y 2 + 8572xlx 2 y x y 2 + ^^x\x\y\ + 5013xf x 2 y^ 
+632yiy 2 - 12360x 2 y| - 5100x 2 y? + ^f^x\y\ + 7269x1?/? - 45025x1^ 
+ 7J W^4v 2 2 + 1J f±4vl ~ 1936x l2 / 2 2 - 84x iy f + ^flyl + 1442y? 
+7269xf y\ + 4356xf yf - 3825xf y\ - \mx\y\ + 2310x? y\ 

+5013xix|y? - 22950x? x 2 y x y 2 - 1505xf y x y 2 - m\\x\y\ - 3010xf x 2 yf . 

Note that t is a polynomial in 4 variables of degree 6 that is quadratic in y. Let us denote the cone 
of sos polynomials in 4 variables (x,y) that have degree 6 and are quadratic in y by £4,6) and its 
dual cone by S| 6 . Our proof will simply proceed by presenting a dual functional £ € X4 6 that 
takes a negative value on the polynomial t. We fix the following ordering of monomials in what 
follows: 

v = [vh 2/12/2, y\ , x 2 yl,x 2 yiy 2 ,x 2 yl, x\y\,x\y x y 2 ,x\y\, x\y\,x\y x y 2 ,x\y\, x\y\,x\y x y 2 ,x\y\, 
x\y\, ^12/12/2, ^1^22/1, x x x 2 y x y 2 , x x x 2 y x , x x x\y\, x x x\y x y 2 ,x x x\y\,x x x\y\, x x x\y x y 2 , 
x\x\y\, x\y\x\y x y 2 ,x\y\, x\x 2 y\, x\x 2 y x y 2 ,x\x 2 y\, x\x\y\, x\x\y x y 2 , x\x 2 y\, x\y\, 
x\y\y 2 ,x\y\, x\x 2 y\,x\x 2 y x y 2 , x\x 2 y\, x\y\, xfy x y 2 , x\y\] T . 

(24) 

Let t represent the vector of coefficients of t ordered according to the list of monomials above; i.e., 
t = tFv. Using the same ordering, we can represent our dual functional £ with the vector 

c = [19338, -2485, 17155, 6219, -4461, 11202, 4290, -5745, 13748, 3304, -5404, 13227, 3594, 

-4776, 19284, 2060, 3506, 5116, 366, -2698, 6231, -487, -2324, 4607, 369, -3657, 3534, 6122, 
659, 7057, 1646, 1238, 1752, 2797, -940, 4608, -200, 1577, -2030, -513, -3747, 2541, 15261, 
220,7834] T . 

We have 

. T ~ 364547 

(C,t )=c T t = — - <0. 

lb 

On the other hand, we claim that £ 6 S| 6 ; i.e., for any form w £ £4,6, we should have 

(£,w)=c T w>0, (25) 

where w here denotes the coefficients of w listed according to the ordering in (|24p . Indeed, if w is 
sos, then it can be written in the form 

w(x) = z T Qz = Tr Q ■ zz T , 
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for some symmetric positive semidefinite matrix Q, and a vector of monomials 

z = [V2, 2/1, x 2 y 2 , x 2 y x , x x y 2 , x x y x , x\y 2 ,x\y x ,x x x 2 y 2 , x x x 2 y x , x\y 2 ,x\y x ) T . 
It is not difficult to see that 

c T w = TvQ ■ (zz T )\ c , (26) 

where by (zz T )\ c we mean a matrix where each monomial in zz T is replaced with the corresponding 
element of the vector c. This yields the matrix 



■ 19338 


-2485 


6219 


-4461 


2060 


3506 


4290 


-5745 


366 


-2698 


6122 


659" 


-2485 


17155 


-4461 


11202 


3506 


5116 


-5745 


13748 


-2698 


6231 


659 


7057 


6219 


-4461 


4290 


-5745 


366 


-2698 


3304 


-5404 


-487 


-2324 


1646 


1238 


-4461 


11202 


-5745 


13748 


-2698 


6231 


-5404 


13227 


-2324 


4607 


1238 


1752 


2060 


3506 


366 


-2698 


6122 


659 


-487 


-2324 


1646 


1238 


-200 


1577 


3506 


5116 


-2698 


6231 


659 


7057 


-2324 


4607 


1238 


1752 


1577 


-2030 


4290 


-5745 


3304 


-5404 


-487 


-2324 


3594 


-4776 


369 


-3657 


2797 


-940 


-5745 


13748 


-5404 


13227 


-2324 


4607 


-4776 


19284 


-3657 


3534 


-940 


4608 


366 


-2698 


-487 


-2324 


1646 


1238 


369 


-3657 


2797 


-940 


-513 


-3747 


-2698 


6231 


-2324 


4607 


1238 


1752 


-3657 


3534 


-940 


4608 


-3747 


2541 


6122 


659 


1646 


1238 


-200 


1577 


2797 


-940 


-513 


-3747 


15261 


220 


. 659 


7057 


1238 


1752 


1577 


-2030 


-940 


4608 


-3747 


2541 


220 


7834. 



which is positive definite. Therefore, equation (|26p along with the fact that Q is positive semidefinite 
implies that (f25j) holds. This completes the proof. 



25 



